Assamese Text to Speech Corpus
OverView
Assamese Text to Speech Corpus 44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual...
Categories
Cart
Account
Search
Recent View
Go to Top
All Categories
×
Request Cart
×
Your request cart is empty!
Search
×
Recent View Datasets
×
Dataset Description
Assamese Text to Speech Corpus
44:49:34 hours | 28.85 GB | 32,594 Audio Segments | 2 Speakers
The LDC-IL Assamese Text to Speech dataset comprises audio files in wav format, accompanied by a corresponding textual layer in Assamese script. This dataset spans a duration of 44:49:34 (hh:mm:ss) , consisting of read speech in the studio setup. The data is derived from 01 female and 01 male native Assamese speakers. A comprehensive explanation of dataset can be found in the Assamese Text to Speech Documentation.
For any research-based citations, please use the following citations:
- Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan. 2025. Assamese Text to Speech Corpus. Central Institute of Indian Languages, Mysore. 978-93-48633-45-3.
- Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2025. LDC-IL Corpus Insights. Central Institute of Indian Languages, Mysore. 978-93-48633-33-0.
Item specifics
- Authors Syeda Mustafiza Tamim, Prangshu Manjul, Stephen Fernandes, Nithin S., Roopashri M. R., Dr. Narayan Kumar Choudhary, Prof. Shailendra Mohan.
- Corpus Type Text to Speech Corpus
- Catalogue Number 1514
- ISBN 978-93-48633-45-3
- Data Source On Field
- Duration 44:49:34 hours
- # of Audio Segments 32594
- Release Date 3/20/2025
- Terms and Conditions General instructions for use of the resources provided by LDC-IL.
Commercial User
Non-Commercial User
LDC-IL Raw Text Corpora: An Overview
LDC-IL Raw Speech Corpora: An Overview